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Some of the major results of future goals of an 
automatic program synthesis project are described in the two papers 
that comprise this document. The first paper gives a detailed 
algorithm for synthesizing a computer program from a trace of its 
behavior. Since the algorithm involves a search, the length of time 
required to do the synthesis of nontrivial programs can be quite 
large. Techniques are given for preprocessing the trace information 
to reduce enumeration, for pruning the search using a failure memory 
technique, and for utilizing multiple traces to the best advantage. 
The results of numerous tests are given to demonstrate the value of 
the techniques. The other paper gives a brief overview of the 
automatic programming system within which the above algorithm is 
being used. This system is still under development. (Author/CH) 
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PREFACE 

The two papers included here describe some of the major results of our 
automatic program synthesis project and indicate some of our goals for the 
future. The first paper gives a detailed and very precise description of our 
algorithm for program synthesis from computation traces. The other paper gives 
a brief overview of the automatic programming system within which the above 
algorithm is being used. 

The research reported in the first paper was done partly in the Computer 
Science Department, Stanford University and was supported by the Advanced Research 
Projects Agency of the Office of the Secretary of Defense (SD-183) and the 
National Science Foundation Grant No. GJ-776. Part of the work was done in the 
Computer and Information Science Research Center at the Ohio State University and 
was supported by Grant Numbers GN-534.1 and GJ-34739X from the National Science 
Foundation. 

The second paper was done in the Computer and Information Science Research 
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Speeding up a Program Synthesizer 



Alan W. Biermann, Richard Baum, Frederick E. Petry 

Department of Computer and Information Science 
The Ohio State University 



Abstract 

An algorithm is given for synthesizing a computer program from a trace 
of its behavior. Since the algorithm involves a search, the length of time 
required to do the synthesis of nontrivial programs can be quite large. 
Techniques are given for preprocessing the trace information to reduce enu- 
meration, for pruning the search using a failure memory technique, and for 
utilizing multiple traces to the best advantage. The results of numerous 
tests are given to demonstrate the value of the techniques. 



keywords: program synthesis, inference, learning, program inference, finite 
state machine synthesis, incompletely specified machines 



I. INTRODUCTION 

The two most important problems to overcome before a practical auto- 
matic program synthesizer can be developed are 

(1) how to design the input to the synthesizer so that it is in a 
form easily used by human beings, and 

(2) how to speed up the synthesis through intelligent use of the input 
information so that large programs can be synthesized in a practical 
amount of time. 

Since it is always possible to simply enumerate the set of all programs until 
one is found which satisfies the given conditions, the question is not whether 
the program can be synthesized but whether it can be discovered after a reason- 
able amount of computation. The designer of an automatic program synthesizer, 
therefore, needs a technique for transmitting information from the human user 
to the machine in a form which is easily used by both. This paper will suggest 
that example computations may be a proper vehicle for such man-machine communi- 
cations. 

We know that when humans communicate ideas to each other, examples 
play a primary part. It is difficult, for example, to imagine any kind of 
explanation of how to write a program or do a particular calculation without 
including some particular cases of how the process is done. Since teaching 
and learning by example seem to be a natural mode of communication for humans, 
it does not seem unreasonable to believe that a man and a machine might com- 
municate in the same way. 

Recent results in inference theory [9, 10, 11, 12, 14,15,17] show that 
not only can man learn from examples, but machines can also. With regard 
to the program synthesis problem, Biermann [91 has given an algorithm for 
generating programs from computation traces, and Raulefs [26] and Barzdin 
[6] have also studied the problem. These papers seem to indicate that correct 



programs can be synthesized on the basis of relatively little input information, 
often only one sample computation, but that the amount of time required to do 
the synthesis grows very quickly as a function of program complexity. Typical 
computation times for constructing the one, three, and five state Turing machine 
controllers given in [9] were about one, five, and one hundred seconds, 
respectively. The implications are clearly that programs of practical interest 
will never be synthesized unless either the old methods are speeded up or new 
ones are developed. This paper will present a number of techniques for speeding 
up the synthesis process and will show that most of the searching done by the 
algorithm in [9] is, in fact, unnecessary. Many of the examples that were done 
in that research involved long searches with dozens or hundreds of backups 
whereas often only two or three were necessary. The greatly increased power 
of the techniques discussed here makes it possible to synthesize programs of 
significant complexity in just a few seconds of time. 

The goal of our research is the development of what we call an autopro- 
gramming system. Such a system gives the user a facility for the easy generation 
of example computations. We are currently c~ing a computer display system on 
which appear data structures declared by the user and the commands available 
for doing a calculation. The user executes an example calculation by referencing 
the commands and their operands (among the data structures) with a light pen 
in scratch pad fashion. The contents of the data structures are continuously 
updated as the calculation proceeds. While the example is being carried out, 
the machine records the sequence of commands, and later, after one or several 
examples have been completed, it constructs the shortest computer program which 
is consistent with these computation traces . As an illustration, the user might 
spend several minutes sorting by hand with the light pen the list of integers 
(3, 2, 1, 4) using some algorithm. After a fraction of a second of computing, 
the machine would type out a general program for the algorithm which sorts N 
integers. This example is explained in more detail in Section V. 



If the synthesized program should exhibit some shortcomings, the individual 
could input additional computations wh' A\ would force the system to appropriately 
revise the generated program. It is well known [9, 18] that any program (or its 
equivalent) may be generated in this manner. 

This paper will discuss the core of the autoprogramming system, the pro- 
gram synthesizer, and another paper [13] will describe the autoprogramming lan- 
guage that we have implemented and its operation. Fortunately, the program synthe- 
sizer is quite language independent and so no details of any language need be 
discussed here. 

It is important that our goals in this work not be confused with those 
of many of the well known researchers on program synthesis methods: Amarel 
[1, 2], Balzer [5], Waldinger and Lee [29], Manna and Waldinger [23] and others 1 . 
These individuals are interested in designing systems which actually synthesize 
the algorithm for doing the problem from very weak input information such as 
input-output pairs or a formal specification of the desired performance. Such 
work is extremely important from the artificial intelligence point of view because 
it addresses the problems of what is thought, how should knowledge be represented, 
and so forth. Our interest however is in man-machine communication, and we assume 
that the user already knows how to solve the problem. Our goal is not to auto- 
matically synthesize an algorithm but rather to provide an easy means for trans- 
mitting the user's concept of the algorithm to the machine. We suspect that the 
major problem for today's practical programmers is more often a problem of 
communication with the machine than discovery of an algorithm. 



See the Proceedings of the Third International Joint Conference on Artificial 
Intelligence, Stanford, California, August, 1973. 



The careful reader will notice that in this paper we are synthesizing 
minimal incompletely specified machines as was done in [3, 4, 20, 21, 22 s 25]. 
In fact, our preprocessing stage where "difference vectors 11 are constructed 
has some resemblance to these earlier methods where "compatibility sets" were 
assembled in the first stage of the synthesis. However, our search technique 
is quite different from those of previous approaches and heavily uses the special 
characteristics of our problem to obtain maximum efficiency. The synthesis 
methods for completely specified machines such as described in [19] and [28] 
of course require no searching at all but they are not applicable to the 
problem discussed here. 
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II. A SYNTHESIS PROCEDURE 

Suppose an individual sitting at a display terminal executes an example 
computation using some kind of autoprogramming language: read into register 1, 
add register 1 to register 2, shift register 2 right one digit, and so forth. 
The contents of all of the registers are visible on the display and the results 
of each command are immediately updated. Assume that the sequence of commands 

* s *i>*2 , *2 , *2 # *2^1 : *l* *2 >^1 : *2 **2 ,C 1 : *1 > c 2 : halt . i^ and I2 represent 
instructions executed, and and represent the existence of some tested 

condition such as register 1 is positive or register 1 exceeds register 2. 

Then the task of the synthesizer is to produce the simplest possible program 

which is capable of executing this trace. 

Each instruction 1^ may occur several times in the same program so the 

k** 1 occurrence of 1^ will be denoted kl^ . Then the program can be constructed 

as shown in Figure 1 where the task is to find which occurrence RI.. in the 

program corresponds to each particular I_. in the trace. Let us assume that 

there will be a limit of L»4 or fewer instructions in the final program not 

including tests and branches so that the instruction set will be either 

" Ul lf ll 2 ,ll 3 } f ' {ll lt 21 lt ll 2f ll 3 >i {ll r ll 2 ,21 2 ,ll 3 }, or {ll^l^.ll^l^. 

(Note that the halt instruction is denoted Ij.) Thus each instruction 

1^1^ and I^ must appear at least once and there can be no more than L-4 

instructions. 

A detailed description of the example of Figure 1 is given in Figure 2. 
The pointer IND is advanced down the trace and each instruction execution 
Ij is associated with a particular kl^ in the program where k is set to the 
lowest number which has not been found to be contradictory. After the eighth 
step in Figure 2, the partial program of Figure 3(a) has been constructed. 
Notice that this partial program implies that the fourth instruction executed 
must bs llg- This is called a forced move and when it is entered by step 9 
of Figure 2 it is parenthesized. An infinite loop would result except that 

O 

£Pj£ condition is observed at the sixth entry in the trace followed by an 
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indicates a contradiction found. 



Figure 1. The example trace and the 

creation of the flow diagram. 
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IND *■ IND + 1. 

Indicate forced move parenthesized 
IND ••- IND + 1 

Give instrution I. the name II 
IND t- IND + 1 J J 

Indicate forced move parenthesized 
IND+-«- IND + 1 

Decrease IND to last unparenthesized move 



Increment name kl. to (k+l)I. 
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IND ^- IND + 1 
Give instruction I, 
IND «- IND + 1 
Decrease IND to last unparenthesized move 



the name II 



Increment name kl. to (k+l)I. 
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Decrease IND to last unparenthesized move 

Increment name kl^ to (k+l)I^ 

Decrease IND to last unparenthesized move 

Increment name kl^ to (k+l)I^ 
IND «- IND + 1 

Give instruction 1^ the name 11^ 
IND *■ IND !- 1 

Indicate forced move parenthesized 
IND «- IND + 1 

Give instruction I. the name II. 
IND «h IND + 1 J J 
Indicate forced move parenthesized 
IND «h IND + 1 

Give instruction I. the name II. 
IND -h IND + 1 J 2 
Indicate forced move parenthesized 
IND «- IND + 1 

Indicate forced move parenthesized 
IND +■ IND + 1 

Give instruction I. the name II. 
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Figure 2. Working the example of Figure 1. 
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Figure 3. The construction of the program 
for the trace of Figure 1. 
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instruction 1^. (See Figure 1.) This instruction is assumed to be 11^ in 
the program yielding the flow chart of Figure 3(b) • This flow chart is 
compatible with the first seven steps of the trace but it yields a contradic- 
tion at step eight implying that a previous decision was incorrect. So the 
pointer IND is decreased to the last move which was not forced (IND = 6) and 
instruction 1^ is given the new name 21^. This process is continued as detailed 
in Figure 2 until a compatible flow diagram is found as shown in Figure 3(c). 
If no four-instruction compatible flow diagram existed, backing up would 
eventually reach the beginning of the trace and the method could be tried again 
with L = 5. 

The reader should note that the condition before an instruction on 

a trace is included whenever it is checked and found to be true. It is omitted 

if either it is not checked or not true. Consequently, on a synthesized 

flow diagram, a particular instruction may have several transitions leading 

away, one blank and the others labelled C^, C^,...,^. In this case, each 

is assumed to be disjoint from if i I* j and the unlabelled transition is 

taken if none of C^^i • • • is true. 

The procedure described above is enumerative and will surely find a 

flow diagram compatible with the given trace if one exists with L instructions. 

If no solution is found, the method can be repeated as L is increased until 

one is found. After the first solution is discovered, the enumeration can be 

continued until additional solutions are found. This algorithm is similar 

to the one reported in [ 9] except that programs are modelleJd with Moore machines 

2 

here rather than with Mealy machines . This seems to be a more natural model 
and has led to many insights for improving performance. Sections III and IV will 
describe several techniques for pruning the search tree and greatly increasing 

i 

the algorithm efficiency. 
2 

In this paper we discuss flow charts and Moore machines interchangeably. 
Thus instructions represent states and flow lines represent transitions. 
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In order to make some of these ideas more precise, it is necessary to 
introduce additional notation. [1^] will be used to designate the ordered sfet 
{llj , 21^ , . . . ,KjI^, } of occurrences of 1^ in a synthesized program. For 
§xample 3 in Figure 3(c), [^^{l^t Z^}- K will denote the number of different 
instructions in the program and will denote the number of occurrences of 

the instruction I.. The condition C. and the instruction I. at the kth level 

J i ' j 

of the trace will be denoted and 0^, respectively. Thus in the trace of 
Figure 1, we have Ni = (blank), °i =I 1 » N 5 =c i>°6 =I l» and so forth. The cardinality 
of set S will be written |s|. 

The function of the synthesis algorithm is thus to 

(1) determine |t I j]| for j=l»2,...,K subject to the constraint .hat 

K 

Z | [I.] | is minimum, and 
j=l 2 

(2) determine which member of [(L ] corresponds to each specific 
occurrence of 0^ in the trace. 

To facilitate discussion of the above points, we introduce a selector variable 

(or state number) S J which is associated with the j th level of the trace. 

The selector variable points to the element of [0^] that corresponds to the 

occurrence of 0^ . Thus (2) can be restated: For each level j, determine the 

value of S^. One can check that in the final solution of Figure 1, S^l, 
2 3 4 5 

S =1, S =2, S =1, S =2, etc.. Since the synthesized program is to be deterministic, 

the assignment of a value to is subject to this constraint: If there are 

levels i and j such that, i< j , O^fQ^i* S 1 " 1 -^ -1 , N^N^, and 0^0^ , 

then S* must be the same as . If the previous condition is satisfied, we 

say that is forced ; otherwise it is arbitrary. Since the search changes 

the values of parameters dynamically, we will use the notation 'name'* to represent 

the current (perhaps transient) value of the entity 'name' . 

K 

The state count SK* is the current value of Z |[I.]*| and 
O j=l 2 

-11- 



the state limit L gives the maximum value for SK* permitted during the synthesis. 
12 1 

If S , S , .tv,S are Incorrectly assigned values, then a contradiction will occur. 
Assignment at level j causes a contradiction if either 

i-1 1-1 

(1) there is a level i,i<j, such that 0 i . 1 =0 : j. 1 » s =s > N i" N j» and 
either Oj^Oj or At is known that sS*S^, or 

(2) SK* becomes greater than L. 

An obvious synthesis technique is to simply try all possible assignments 
1 2 M 

of S , S ,...,S (assuming a trace of length M) for increasing values of L 1 
until a contradiction free (or compatible) assignment is found. Then the 
traasition set of the synthesized (or compatible) machine is 

{(S^~ 1 0j j^Nj.S^O^Il^jjeM}. The problem with a strictly enumerative technique 
is that as L-K becomes larger, the search space grows exponentially and the 
processing time for the synthesis becomes prohibitive. The next section will 
address the problem of limiting this search. 

Before leaving this section, one can study a second example of the 
synthesis technique by applying the method described above to the computation 
trace in Figure 2. When this is done, the program of Figure 4 results illustrating 
that the synthesis technique can be employed to create its own program. The 
program of Figure 4 does, in fact, correctly express the synthesis algorithm 
discussed above and will be the starting point for the treatment in Section III. 
The reader who takes the trouble to work out this second example will discover 
that the desired program is created directly without any search or backing up. 
This always occurs when the program being constructed has no duplicated instruc- 
tions, and we can even state a theorem: If the desired program has no duplicated 
instructions and if the given trace covers every transition in that program, 
then the program will be synthesized directly without any searching. Thus 
searching is only required to resolve the ambiguities which result from multiple 
copies of the same instruction in a program. This has important implications 
rn?^- for practical synthesis problems because typical programs usually have few 
duplications. 
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states L is exceeded 



IND «- IND + 1 



Print the solution 



Forced move 
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Figure 4. Flowchart for the synthesis algorithm. 
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III. PRUNING TECHNIQUES 

To make the synthesis process feasible it is nec«ssary to find a way to 
reduce the search space. Pruning techniques fall into two categories: dynamic 
and static. Static processing performs a ^structural analysis of the trace without 
considering the effects of assigning specific values to the selector variables. 
Dynamic processing is performed while the search is in progress and uses all assigned 
selector variables as well as a record of potential contradictions to avoid mistakes. 

Static processing, also called preprocessing , scans the trace and builds up 
nonequivalence information which is subsequently used Jry the dynamic phase of the 
synthesis. 

Preprocessing makes use of distinguishing sequences found in the trace to 

differentiate O-equivalent states. Let i and j be levels in the trace where 

9=0^ (assume i<j). If N.. °N« . for p=l,2,...,k and 0. , "0.. for q«0*l,..., 
i j i+p j+p i+q j+q 

k-1 and 0 1+ j^0j4fc then the states represented by S^Oj and S^Oj are k-nonequivalent. 
It then follows that S 4 "^. This information is kept in a difference sfet DV^ 
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associated with each level ft. DV^ is defined as the set: 

{i |*<i^l and 0^0^ and S^S^ }. 

We now use the difference sets to determine a lower bound or initial guess 

K 

L for each | [I. ]|. A lower bound for L is then given by Z L . L, is the largest 
K K k K l 

m such that there exist levels tt- 1 »^2 , '" , ^m ^l < ^2 < *" < ^ta^ having the following 
properties: 

(1) 0 fl =0 # =...=0 § »I and 

*1 *2 % m k 

(2) For i=2,3,...,m: l.€DV g for j=l,2, . . .,i-l 

We will now introduce the parameters required to describe dynamic processing. 

K 

The free state limit FL is defined as L- 



or more values of j . Level i is called 



2 L ± . If FL>0 then |[Ij]|>Lj for one 
tfe ^working level WL* if S^S 2 ,...^ 1 



i"H M 

have been assigned values but S , ...,S remain unassigned. The free state 

K 

count F* is given by F- Z max( | [I. ]*{-L. ,0) . Most of the actions of dynamic 

i=l ii 

processing are dependent on the contents of a failure memory FM. The failure 
memory may be conceptually visualized as an M by L matrix with each row FMj 
corresponding to a level in the trace. Each matrix element FM i j is a couple 
(W^jG^). W is called the structure factor and is called the free state 
factor. The effect each FM^ has on the search depends upon whether the element 



is valid , latent, or null. 



An element is valid if N^j^ and 



F*<G i j, an element is latent if W,^>0 and F*^^, an element is null if 
W^^O and an element is invalid if it is null or latent. A latent or valid 
element will become null if WL* ever becomes less than W^ . If FM^j is valid 
then S* may not be assigned the value j. Some of these definitions are summarized 
in Figure 5. 





F*c Gij 




w ij>0 


valid 


latent 


V 


null 




Figure 5. 





This therefore outlines the basic failure memory technique. For example, 

suppose it is known from other considerations that if at working level 9 the 

9 

selector variable S is set to value 2, then a contradiction will later result. 

Then the failure memory entry at FM^ ^ will be valid, and during the search, 
9 

S may be set to 1,3,4, etc. but it should never be set to 2. Clearly if large 

numbers of iailure memory entries are valid, the number of alternative settings 

for each will be greatly reduced and so also will be the total size of the search. 

9 

Suppose also that (3,4) is the entry in S which is valid. This means that the 
9 

above assertion (S *-2 will cause a later contradiction) is known to be true 



if 



(1) no changes have been made at level 3 or above since the entry (3,4) 

12 3 

was made, i.e. S ,S ,S are unchanged, 



and 



(2) there are fewer than 4 free states currently available, F*<G^ 

The entry (3,4) may be interpreted as an indicator of the cause of failure. The 

9 12 3 

setting S »2 will yield a contradiction as long as S ,S ,S are unchanged and the 

number of available free states is less than 4. The pruning technique therefore 

involves limiting the search by keeping the failure memory updated with as many 

valid entries as possible. 

Dynamic processing makes use of static nonequivalence data by adding information 

to the failure memory. Whenever the selector variable at level % is assigned 

the value j, the following procedure is performed: 

For all i € DV ft do; if FM^ is invalid then PM^ 3 (Jt,FL+l); 

fl i 

In other words, if S is given a value j and S is known from preprocessing to be 

nonequivaient to S*, make an entry into FM^ which will prevent the assignment 
c i Notice that the unconditional nature of static nonequivalence information 

is inherent in the failure memory entries made above since the free state 

factor FL+i insures that FM^ is either valid or null. 

erJc -lfi- 
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Since the failure memory is dynamically changing it is possible for two 
statically equivalent levels to be dynamically distinguished. For example, let 

i and j (i<j) represent levels in the trace where 0^°^ ,N i+l* N j+i» °i+l*°j+l 
and N i+ 2^ N j+2* clearl y» preprocessing is incapable of distinguishing S* and sK 
Suppose that dynamic processing assigns 8* + ^* the value k and ^(j+jj^ * s or becomes 
valid; then sS*S j . 

Dynamic nonequivalences are detected by using couple classes . A couple class 

[INI] is defined to be [I N I ]» {i |l<i<M,0. =1 ,N «N ,0=1 }. 
1 p q s J 1 p q s J 1 1 ~ l-l p* i q* i s 

If i and j are members of the same couple class then we say that level i and level 
j belong to the same couple class. Suppose levels i and j (i<j) belong to the 
same couple class then (a) if S**" 1 ^" 1 then S i= S^ (i.e. S"* is forced) and (b) 

ii it is known that sS*S^ then S* S*S* 1 . Hence, couple classed give dynamic 
processing an efficient way to detect forced moves and to propagate dynamic 
nonequivalence failure memory information. Selector variable S^ is forced if 
and only if 

(1) |[0 j _ 1 N j 0 j ]|>l > 

(2) there is a level i,i<j, such that i€ t°j-i N j°j3 
and (3) S 1 " 1 ^" 1 . 

In order to implement information from (b) above, ^(j^^) s** 1 can ^ e u P^ ate< ^ 
as follows if it is currently invalid: FM^^ gi _i is set to (WL*,q) 
if (1) levels i and j are in the same couple class, 
(2) S^n 

and (3) FMj n contains the nonnull couple (p,q). 

We will now turn our attention from nonequivalence induced contradictions to 

contradictions arising from violation of the state limit L. The assignments 
1 2 WL* 

to S ,S ,...,S are said to violate the state limit if they collectively 
imply that the number of states in the final synthesized machine will be greater 
than L. If the state limit L is violated then a fence will occur at some level 
in the trace. An incipient fence exists at level & if for all i<| » 
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FM^ is valid. An incipient fence implies that a new element must be added to 

[0 a ]* (or in other words, that a new state must be created). If O.-L and 
» A k 

U I | c ]*l>? J k and F*>0, then F* must be decremented by one since the new state has 
"committed" a "freestate" to be an element of A fence exists at level I 

if an incipient fence exists there and | [1^]* |>L k and F*=0 . A contradiction 
immediately follows from a fence since it implies that a needed new state 
cannot be created. 

Dynamic processing uses the concept of the pseudo-assigned selector 
variable to further enhance its contradiction detection mechanism. 1 Selector 
variable S which is to be assigned a value later in the search is said to 
be pseudo-assigned the value n if validation of FM causes a fence at level i. 
That is, FM^ is valid for all possible i except i»n so S*=n will be the only 
possible assignment at the later time whn WL**5- Therefore, if there is a j 
with the properties WL*<j<fi and i is in DV^ , then ftie assignment S^=n would 
result in a fence at level 5 . Therefore failure memory entries are entered 
in column n for all such j: FMj n =(WL*,l) if WL*<j<jl and i is in DV ^ . The 
free state factor 1 insures the entries will be valid since the existence of 
the potential fence implies F*=0. In addition to these failure memory entries, 
pseudo-assignment at level S also causes entries at levels greater than i 
in the same way as an ordinary assignment at level JL 

If the assignment of the value n to causes a contradiction then dynamic 
processing must perform a f ixup or a backup. The fixup operation unassighs 

(thus causing WL* to be momentarily decremented) and then reassigns the value 

ft 1 

n+1 to S . The fixup operation cannot take place if S is forced or if the last 

assignment to created a new state or if validation of FM^ n causes a fence. 
When is reassigned n+1 a failure entry must be stored in ^^ n " Before we 
describe how this entry is built, we must introduce the concept of usage infor- 
mation. 

Every decision made during dynamic processing is dependent on the assignment 
!M£of values to certain selector variables. For example, a forced assignment to S* 



depends on the values S 9 S* , and where ft, and j are in the same couple 

class. By maintaining a record of exactly what selector variables contributed 

to every assignment or failure memory entry it is possible to update the failure 

memory during fixup and backijp. Now we are ready to give an intuitive definition 

of the structure factor. The structure factor W* is the greatest j less than or 

equal to WL* such that contributed to the decision presently being consumated. 

Whenever WL* is decreased to j all failure memory entries (p,q) with p>j become 

(0,0) and all side effects (except usage information) of nulled failure memory 

entries and previously assigned selector variables at levels greater than j are 

removed. Thus the states of all entities other than usage are restored to the way 

they were when WL* was last equal to j. Therefore whenever a fixup occurs at 

level i as described above, the entry FM^ n is set to (W*,F*+1) 

Backup is initiated from level I If: 
o 

(1) 8 is forced and FM^ g l is valid or 

(2) causes a contradiction and represents a new state or 

(3) validation of FM^ g will create a fence . 

The function of backup is to find the greatest j such that (1) contributed to 
the contradiction causing the backup and (2) it is possible to apply a fixup 
operation on . Backup operates by repeatly setting WL* to W* until a fixup 
operation can be applied. Failure memory entries may be created at levels 
bypassed by backup. The entries are of the form (W*,F*+1) and are located on all 
levels i (in the appropriate column) such that W*>j . If backup causes WL* 
to become zero then there is no L-state machine compatible with the trace. In 
this case, all limit variables are increased by one and the search begins again. 
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I 



In order to illustrate the operation of the algorithm, we will give a 
"blow by blow" account of an example synthesis by tagging all dynamic entities 
with an event number. These tags indicate the creation order of the entities. 
When an entity is redefined, we will simply write the new value to the right of 
the old one so that a complete history of the synthesis is available when the 
algorithm terminates. 

This example and a second example in Appendix 1 each consists of two 
tables. The EVENT TABLE provides commentary of each event. The TRACE TABLE 
provides a representation of all entities, static and dynamic, used by the 
algorithm. The TRACE TABLE consists of the following columns: 

Level - - - - a level number 
0^ ----- an output symbol 
% ----- an input symbol 

Couple-class - - the couple class that contains the level 
Difference set - the set DV^ eVe ^ 

1 evel 

Selector values - the integer values that S takes during the synthesis 

FMfc-j _ - - - this column represents column j of the failure memory 

matrix. An entry (Wij>Gij) is represented by the 

notation ^ijgjj 

Note that the string 'entity' is tagged by writing it as 'entity' (event number). 
The difference vector and couple class information is set up during the 
preprocessing and the numbered events refer to the dynamic phase. 
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Event<s) Level 



Discussion 



0 




1 


1 


2-10 


35,32,26,. 




20,11,9,6 


11,12 


2,3 


13 


4 


14 


32 


15 


6 


16,17 


29,23 


18 


20 


19 


11 


20-22 


35,29,26 


23 


9 


"4,25 


5,6 


26 


7 


27 


8 


28-33 


9-14 


34 


34 


35 


33 


36 


33,34 


37,38 


14,15 


39 


19 


40 


16 


41-42 


19-18 


43 


18 


44 


18,18,19 


45 


16 


46 


18 


47 


17 


48 


37 


49 


18 


50 


31 


51 


30 


52 


30,31 


53 


18 


54 


19 


55-74 


19-38 



Initiate synthesis, F=0, FL=0 

is assigned value 1 (assignment) 
Failure memory entry created at a level in DVj^ (DV^ resolution) 

assignment 

is assigned 2 since FM41 is valid 
DV4 resolution. 

FM53 is made valid since S 6 =3+ FM32 3is valid* fence at level 32. 

The first implication follows frolta DVg (look-ahead fence 

prevention at 32 from 6) 
DV^ resolution 

look-ahead fence prevention at 23 from 20 

11 

DV4 resolution, since S A is now pseudo-assigned the value 3, since 

^11 1 aud ^11 2 are valid and L A" 3 and F=0 
DVjj resolution performed since is pseudo-assigned 

DV4 resolution 

assignment 

must be assigned value 1 since' S^=l and 5,7 CC3 (forced move 
(denoted by dot) ) 
forced move 

Assignment/ forced move 

^34 3 * s va l*dated since 14 and 35 are both members of same couple 
class but 4 1 (dynamic nonequivalence via CCg) 

dynamic nonequivalence via CC7. This implies that | [S J | >1» but 
this is a contradiction since F=0 and Lg-1, so 

the assignment S^^«l is incorrect. Invalidate FM33 ^ ^34 3 

assignment 

dynamic nonequivalence via CCg 
assignment 

dynamic nonequivalence via CCg 

dynamic nonequivalence via CC9, since S^ 9 cannot equal S 2 (they're 
both pseudo assigned) and since level 19 and level 20 are in the 
same couple class it follows that S 18 4 S 19 . Since S 18 and S 19 
cannot be assigned 1 or 2 it follows that |[A]|>3, but this is 
a contradiction since F=0 and L A =3, so 

the assignment s!6=l is incorrect. Invalidate FM^ ^, ^18,1* ^18 2 

assignment 

dynamic nonequivalence via CC9 

assignment 

DVjj resolution 

assignment 

dynamic nonequivalence via CC^g 

dynamic nonequivalence via CC2> This implies that |[P]|>1, this is 

a contradiction since F=0 and Lp=l 
the assignment S^ 8 is incorrect. Invalidate FM31 2 anc * ^30,1 
assignment 

dynamic nonequivalence via CC9 
assignment/ forced move 9 synthesis complete 



Figure 6. Event table for example synthesis. 
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I - {A.P.S.R.T} 



Level 

1 

2 

3 

4 

5 

6 

7 

8 

9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 
28 
29 
30 
31 
32 
33 
34 
35 
36 
37 
38 



Ni 



a 

A 
N 
R 
N 
R 
B 
N 
R 

R 
ft 

& 

JL 
& 
* 
R 
B 
K 
R 
D 
St 
R 

a 
» 

R 
Y 
N 
R 
D 
N 
R 
Y 



A 
P 
A 
A 
R 
A 
R 
A 
A 
S 
A 
S 
A 
A 
A 
A 
A 
A 
A 
A 
R 
A 
A 
S 
A 
A 
R 
A 
A 
R 
A 
A 
St 
A 
A 
R 
A 
T 



Couple-class 

.+ 

CC- 

cc 2 

• 

CC- 
CC, 
CC* 

CC4 

cc* 
cc^ 
cc* 

CC 6 

cc£ 
cc fl 

CC 9 

cc„ 



10 



cc; 

CC 
CC 

cc 

cc l 

cc c 

CC" 
CC, 

cc 
cc c 

CC, 



CC. 
CC 10 

cc;: 

CC?. 
CCg 
CC 

cc; 

4 



Couple class naming for couple classes 
with cardinality greater than 1. 

[A-N-P] = CC. = {2,30} 

[P-R-A] = CC, = {3,31} 

[A-N-R] = Ccf = {5,7} 

[R-R-A] = CC 4 = {6,8,22,28,37} 



Difference Set 
{35,32,26,23,20,11,9,6,4} 

{32,29,23,11,9} 
{32,29,23,11,9} 

{35,29,26,20} 
{35,29,26,20} 



{37} 

{32,29,23} 
{35,29,26} 
{32,29} 

{35,32} 

{37} 
{35} 



[A-B-A] = CC = 

[A-N-S] = CC£ = 

[S-R-A] = CC* = 

[A-P-A] = CCg = 



{9,23} 

{10,12,24,33} 
{11,13,25,34} 
{14,26,35} 



+ a • means the couple class has cardinality 1 
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Selector Values FM #2 FM #3 Level 



KD 

1(11) 

1(12) 



1(24) 



.1(26) 



1 
2 
3 



2(13) 1^(10) 4 



5 



2(25) 1(9) 4.(15) 6 



7 



.2(27) 8 

3(28) 1 (8) 4.(23) 9 

1(29) 1 1 10 

3(30) 1(7) 4.(19) 11 

1(31) 1 1 12 

3(32) 13 

1(33) ,2(37) 14 

1(38) 15 

1(40) 2(45) 16 

.1(47) 17 

2(49) 3(53) 16 1 (42),0(44),16 9 (46) 16. (43) ,0(44) 18 

1(55) 16^(41), 0(44) 15!(39) 18,(54) 19 

.2(56) lf(6) 1 4^(18) 20 

•K57) 1 1 21 

•2(58) 22 

.3(59) l x (5) 4.(17) 23 

•1(60) 1 24 

•3(61) 25 

.2(62) 1.(4) 4.(22) 26 

•1(63) 1 1 27 

•2(64) 28 

1(65) 4.(16) 4.(21) 29 

.1(66) 18. (51) ,0(52) 1 1 30 

.1(67) 18^(50), 0(52) 31 

•3(68) if (3) 4.(14) 32 

•1(69) 14}(35),0(36) 1 33 

•3(70) 1 14. (34) ,0(36) 34 

.2(71) 1.(2) 4^(20) 35 

•1(72) 1 1 36 

.2(73) 17.(48) I 37 

1(74) 1 38 



[A-X-A] = CC Q = 
[A-Y-A] = ccJ Q = 



{15,16,17,19,20} 
{18,32} 



I; 



ERJ.C 



Figure 7. (Continued) 
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IV. A PROGRAM FOR COMPARISON 



Theoretical methods do not exist for evaluation of our pruning techniques 
and so 1% was necessary to do experimental comparisons with other algorithms. 
A program was written to execute a basically enumerative algorithm which 
included no sophisticated preprocessing or pruning features at all. The program 
was written largely in the spirit of the diagram of Figure 4 except that a 
technique for processing multiple traces in parallel was included. Whereas, 
the algorithm of the previous section handles multiple traces by concatenating 
them head-to-tail to make one long trace, a simpler algorithm can utilize the 
several traces better by conceptually laying them side by side and working on 
them simultaneously. 

Parallel processing is achieved by using the algorithm already given 
applied to a subset of the traces. The choice of the subset for a given stage 
is illustrated in Figure 8. The current levels of the traces in that figure are 
2,2,1 where the instructions are l^Z^l^ followed by ^2 9 ^2 9 ^2 9 res P ectivel Y • 
The lowest instruction followed by the lowest condition (by some ordering) is 
chosen along this frontier and the next guess is made for the associated traces. 
In this case, 1^ followed by is lowest indicating that the next guess will 
be for trace 3, the second level. At the next stage all the levels happen to 
be 2 and the subset chosen for consideration consists of traces 1 and 2. (The 
frontier instructions are ^2**2**2 anc * next conditions are C2,C2»C,j.) 

After a subset of the traces is chosen, the algorithm is then used to add 
the appropriate transition and perform as many forced moves as possf^^Hon these 
traces. As before if a contradiction occurs while forcing or attempting to add 
a transition, then a backup is done until some transition can be changed. Thus 



again in Figure 8, at stage 3 when traces 1 and 2 are considered, a contradiction 
occurs when a transition is attempted to be found for followed by and 
backup must occur. 

The advantage of parallel processing is that if a transition which has been 
added would lead to a contradiction it would likely be discovered sooner than 
by the sequential processing. The latter might go through several traces 
completely before the same contradiction could be found. Consider, for example, 
the contradiction mentioned above between traces 1 and 2. An experiment 
measuring the usefulness of this parallelism is described in Appendix 2. 

Trace 
3 

h 1T 1 

c 2 i 2 

C 3 T l 

* 
* 
* 

Figure 8. Parallel Processing Example 

The comparison algorithm should then be thought of as probably the best, 
possible algorithm that can be obtained without much sophistication. 



Trace Trace 
1 2 



c 2 h c 2 i 2 
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V. THE EXPERIMENTS 



The performance of the two algorithms described here was compared with that 
of the original paper , Biemann [92. Turing machines were to be learned as 
shown in column one of Figure 9 where the three letter triples on each transition 
indicate the symbol read, the symbol printed, and the head move direction, 
respectively. As in the original paper, Mealy machines are shown although in 
our recent experiments thiiir Moore equivalents were used. Column two indicates 
the sets of traces input t:o the algorithms by giving the initial tape associated 
with each trace. Each trace can be reconstructed by applying the proper Turing 
machine to the given initial tape with the head initially placed at the leftmost 
nonblank symbol. In terms ef the notation of this paper, a transition ABL, 
for example 9 is transcribed as a tested condition N^=A and a resulting executed 
instruction O^-BL, print a B and move left., An examination of these results 
indicates that the recent programs are much faster than the original system but 
part of this improvement may be due to a switch from compiled Stanford LISP 1.6 
to FORTRAN. (The machine in each case was a DEC PDP-10.) The second 
conclusion is that on these relatively easy problems, the enumerative parallel 
algorithm is approximately as fast as its more sophisticated counterpart. 
This is largely because the enumerative algorithm wasted no time on preproces- 
sing or failure memory overhead and the amount of backing up was not large 
enough to become catastrophic. 

As a more serious test of our system, it was decided to attempt to 
synthesize a universal Turing machine of the style of Minsky[24l, Chapter 7. In 
particular, it was proposed that if our system were led through a simulation of 
the two state Turing machine in Minsky's example 7.3, it should type out a copy 

O 

gPj£" of Minsky's universal machine. After some experimentation, it was found 
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Turing Machine 



BXR 




Set of traces 



{ AXA, AXB, BXA 
BXB, AAXA, AAXB , ABXA 
ABXB, BAXA, BAXB 
BBXA, BBXB, AXAA} 



Enum. 
Algor . 



.15* 




AAR 



{ ABXBA , AXB, BAXAB} 



{AAA, AAB , ABA, ABB 
BBB.BBA.BAB.BAA} 

{ AAA, AAB , ABA, ABB 
BAA.BAB.BBBA} 

{ 3AAABBBAABAB , 
AABABBABAAB} 

{ A,B , AA, AB ,BA,BB , 
AAA, AAB, ABA, ABB, 
BAA^BABjBBB} 

{ABA, BAA, ABB, BAB, 
BBA, AABjAAABBB} 



ABL L \ ABR 



BAR 



AAL 



BBR 



{A, B, AA, AB, BB, 
AAA, AAB, ABB, BBB, 
AAAA, AAAB} 



{ AABB.B, AAABBB } 



.05* 



1.15 



1.12 



3.08 



1.70 



1.30 



.20* 



,10* 



With 
Pruning 



.23* 



.07* 



.45* 
.42* 
2.18* 
.70* 

.63* 



.22* 



.17* 



From 
Ref . [9 ] 



13.13 



22.58 



** 
317.65 



** 
1106.60 



151.9* 



** 

96.0* 



* Alternate but correct solution found. 
** Input traces shown are slightly different from those in [9"]. 

Figure 9 . Synthesis times for several Turing machines in seconds. 



that this can be accomplished if the universal machine is separated into four 
subroutines Cas Minsky has done) which can each be synthesized individually. 
The first experiment of this type yielded routine 4 synthesized properly, 
routines 2 and 3 synthesized with errors, and an unending search (more than 
several hours) for routine 1. A study of the results of this experiment 
indicated that the example was too simple to yield a completely general 
universal machine and that the two leftmoving states in routine 1 were too 
similar to be distinguished by our system in a reasonable amount of time. 
Consequently, a slight modification was made to Minsky f s example to force the 
system to produce a completely general machine, and one of the leftmoving states 
in routine 1 was moved into routine 4 so that it could be synthesized independ- 
ently. With these changes the universal machine of Figure 10 was constructed 
easily by our best algorithm as shown in Figure 11. An instruction L or R 
means move left or right while any other instruction I means "print I". All 
transitions of the form iL-C-iL or iR-OiR have been omitted from Figure 10 for 
the sake of readability. Thus if the head is moving left when a symbol C is 
read and no corresponding transition is shown, then the same instruction L is 
executed again. The only item missing from Figure 10 is the transition to the 
halt instruction which is never encountered in this infinitely long example 
computation. 

This universal Turing machine was constructed from a computation 
simulating the two state Turing machine shown in Figure 12. The transitions 
of the machine to be simulated are coded in the form Q S Q f S f D where 
Q is the current state, 
S is the symbol read, 
Q 1 is the next state, 
S f is the symbol printed, and 
D is the direction of the move, 
0 for a left move and 1 for a right move. Thus the transition in Figure 12 from 
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Routine 


Number of 


Number of 
iransitions 


NUIQDcF OI 

Simulated 
moves 
Required 


lOtal 

Length 
oi input 
Trace 


P"DTT 4- 4 «m a 

Lru time 
to synthesize 
routine 
(Seconds) 


1 


11 


30 


6 


846 


25,23 


2 


10 


34 


4 


593 


9.43 


3 


13 


36 


6 


455 


9.78 


4 


15 


36 


6 


321 


11.93 



Figure 11. Times for synthesis of the four routines 
of the universal Turing machine 




Simulated Machine 



current state of simulated machine followed 
*r by symbol being currently read 

0000M000Y011X0111010X1000111X0100101X1011000Y 
i , f « ■ — ' 

Simulated Transitions of the simulated 

tape machine 



Initial tape for the simulation 



Figure 12. The universal Turing machine was constructed 
on the basis of six simulated moves of this machine. 
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state 01, reading symbol 1, going to state 10, printing symbol 1, and moving 
left is coded 0111010 and given to the universal Turing machine in that form. 
The initial tape for the example computation which yielded the universal machine 
was as shown in Figure 12 with the tape to be simulated, the current state and 
symbol read, and the transitions to be used included. M marks the location of 
the simulated head and the X T s and Y's serve as markers to separate the 
sequential items on the tape. We can conclude from this experiment that the 
four routines of the universal Turing machine can be constructed from one 
sample computation involved six simulated moves and the total CPU time required 
for the synthesis is less than one minute. This synthesis was completed with 
our best algorithm using preprocessing and pruning techniques, and none of the 
four routines could ever be found by our purely enumerative program even after 
searches of 45 minutes or more. 

The performance of a system can sometimes be best judged if it is studied 
over a class of examples rather than a few individual cases as we have above. 
Toward this end, we will introduce a grammar-generated class of schema and 
examine the performance of our programs over a range of problems. 
Definitions: 




is an operator which concatenates a program with itself 



i times. C. operation on the null program yields a 



sequence of i copies of the instruction A. 



L 



is an operator which adds a transition to a program leading 



from the last state to the first state. The condition 



associated with this transition is different from any other 



associated with the last state. 



Q 



is an operator which works on instructions. If instruction 



A has any labelled transitions leading away, then Q(A)=Bi 



where Bi is an instruction different from any other instruction 



in the program. Otherwise Q(A)=A. 
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S(i,j>k) is the program which results if Q is applied to every 
instruction in 




One of the members of this class, S(3,2,2), appears in Figure 13, and the synthesis 
times for a whole range of these programs are given in Figure 14. Each solution 
was constructed from one trace which was generated as if each loop was an ALGOL 
FOR-loop with an index advancing through values i=l,2. If each loop is 
executed n times every time it is entered, the length of the trace will ba 



to cause a memory overflow. 

Figure 14 seems to indicate that problems of the type S(i,j ,k) should be 
done enumeratively , and that the large overhead associated with preprocessing 
and maintenance of a failure memory is not worthwhile. However, if we make the 
problem more difficult by equating all of the B^ f s in each S(i,j,k), the 

conclusion is reversed. (B=B^=B2- ) Figure 15 shows the results of this 

modified experiment where the more sophisticated program produced some answers 
easily that could not be found otherwise after even 45 minutes of searching. 

In conclusion, we offer one example of the type of operation that we will 
expect from our autoprogrammer system. A user seated at the display terminal 
wishes to sort the sequence of N=4 integers 3,2,1,4 which appears on the screen. 
He wishes to use a selection sorting method, and he has a number of commands 
available on the screen which can be referenced with the touch of a light pen. 
He sets a pointer I to the first item, 3, a pointer J to the second item, 2, 
notes the two are out of order , interchanges them, increments J, and so forth. 
With proper hardware and software* we are hoping he will be able to complete 
the sort in a minute or two. As he goes through these steps, the autoprogrammer 
will store the sequence of actions shown in Figure 16 and then will output the 
indicated program. A report on this system will be soon forthcoming by 
Biermann and Krishnaswamy [13] . 




Omissions in Figure 14 occur because generated traces were long enough 
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Figure 13. S(3,2,2) 
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I 



i 


j 


k-1 


k-2 


k=3 


1 


1 


.00(.02) 


.02(.00) 


.OO(.OO) 


1 


2 


.OO(.OO) 


.02(.02) 


.09(.03) 


1 


3 


.00(.02) 


.09(.03) 


2.32(.23) 


1 


4 


.02(.0G) 


.67(.12) 




2 


1 


.OO(.OO) 


.04(.02) 


.34(.15) 


2 


2 


.OO(.OO) 


.22(.03) 


16.39(1.22) 


2 


3 


.03(.02) 


3.04(.15) 




2 


4 


.02(.02) 






3 


1 


.OO(.OO) 


.17(.23) 


5.92 


3 


2 


.04(.02) 


2.49(.73) 




3 


3 


.07(.02) 


(2.65) 




3 


4 


.20(.03) 






4 


1 


.00(.02) 


.65(7.48) 




4 


2 


.0S(.02) 


(76.70) 




4 


3 


.17(.03) 






4 


4 


.52(.03) 







Figure 14. Synthesis times in seconds for schema Sti,j,k) using the 
program of Section III. Times for the purely enumerative algorithm 
are parenthesized. 
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1 j k=2 k=3 

2 1 .05(.08) .62(45.55) 
2 2 .28(1.25) 6.15 

2 3 2.58(20.22) 

3 1 .73(16.3) 

3 2 122.95(> 45 min.) 

4 1 8.08(> 45 min.) 

* Alternate solution found because of Inadequate trace. 



* 



Figure 15. Synthesis times in seconds for schema S(i,j,k) after all B^s have 
been equated. Times for the purely enumerative algorithm are parenthesized. 
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A(I)>A(J) 
A(I)>A(J) 

J>N 

A(I)>A(J) 
J>N 



J>N 
I=N 



I-f-1 

J<-I+l 

A(I)+A(J) 

J+J+l 

A(I)+A(J) 

J-<-J+l 

J+J+l 

I+I+l 

J<-I+l 

A(I)+A(J) 

J+J+l 

J<-J+l 

I+I+l 

J-f-I+1 

J<-J+l 

I+I+l 

HALT 



A(I) > A(J) 




The Trace 



The Program 



Figure 16. An autoprogrammer example. Constructing a selection sort program. 
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Finally it should be said that the computation times given in this 
section should be considered to be indicators of comparative algorithm 
efficiency and not absolute measures of any kind. Any of the programs dis- 
cussed here could obviously be improved and the times given indicate what 
can be attained with moderately careful programming. 
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VI. CONCLUSION 



The experiments described here demonstrate that programs of significant 



complexity can be synthesized in reasonable time from instruction traces. We 
do not expect that our autoprogramming system will be greatly burdened by 
long waits for the completion of a synthesis. However, it may be that this 
work will have broader significance because it provides a benchmark for those 
individuals who are interested in program synthesis from much weaker 
information. If our system with its very complete input information has 
difficulty in doing a construction, one can expect a vastly more difficult task 
in doing the synthesis from simply input-output pairs, sparse traces, or formal 
specifications on desired performance. 
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APPENDIX 1 

We will give here another example in the style of Section III. In 
this synthesis, we see the algorithm search for and fail to find a solution. 
Then it creates a free state and succeeds. 
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Event Level Discussion 

initiate synthesis, F=0 and FL-0 
assignment 

DVg resolution, and are now pseudo assigned 

look ahead fence prevention at 17 from 8 
assignment 
DV5 resolution 

dynamic nonequivalence via CC-^, this implies that|[A]|>l, 

this is a con tradi ction , so 

the assignment S -1 is incorrect. Invalidate FM-^q ^ and FMn 1 
assignment * 
DVj resolution 
assignment 

is forced to be 2, this is a contradiction since FMg 2 ^ s 
valid, so 

17 Initiate backup: s7 and cannot be changed since Le=Lc=1 and F=0 

s5 cannot be changed since Lg=2 and F=0 
S cannot be changed since L^=l and F=0 

cannot be changed since this level represents 
the first occurrence of [B] and so changing 
to 2 would simply be a renaming of an existing 
state. 

and sl cannot be changed since L^=Lg=l and F=0 
At this point we have backed up to the first level of the trace so 
Invalidate all failure memory entries, set FL=1, F=0 and initiate 

synthesis again 
assignment 
DV3 resolution 
assignment 
DV5 resolution 

dynamic nonequivalence via CC^, since this implies | [A] | >1 we must 

use available free state to allow |[A]|=2. Now F=0. 
since F is now zero level 17 (or 14) can be pseudo assigned the 
value 2, thus FMg 2 is made valid via the look ahead fence 
prevention mechanism. 
28-42 6-20 assignment/forced move, synthesis complete 
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Figure Al. Event table for example synthesis. 
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APPENDIX 2 

In order to study the effect of parallel processing in our enumerative 
algorithm, a large number of tests were run on the second problem in Figure 9 . 
Figure A3 shows the results of these experiments where the number of modes 
expanded on the search tree are plotted versus the number of input traces with 
the total of the lengths of the traces indicated beside each point. The times 
for most of the runs (including input-output operations) were in the range of 
2 to 5 seconds op a IBM 370/165 with the longest run requiring 9.5 seconds to 
do a synthesis from 60 traces. Thus the overhead involved in the parallel 
computation becomes significant indicating that there is probably an optimum 
number of traces for synthesizing this machine. It can be shown that the Moore 
form of this machine cannot possibly be constructed from only one trace, and 
considering the uncircled points only, two or three traces did not yield a 
quick synthesis. However, four or five traces seemed to be quite sufficient 
and additional traces helped none whatsoever. 

The circled points indicate that a second (equivalent) version of the 
Turing machine was synthesized and this occurred if most of the traces processed 
two or more B's before processing A's. Thus the performance was heavily 
dependent on the first few transitions of each trace. 
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Figure A3. The usefulness of parallelism in enumerative processing. 
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I, INTRODUCTION 

An au top rog rammer is an interactive computer programming system which 
accepts as input example calculations and which yields computer programs for 
doing those calculations. Such a system provides the user with a sort of 
scratch pad and command system for executing the examples, and it synthesizes 

programs on the basis of a recorded history of the steps required to do those 

i 

computations. This paper will briefly describe an au top rog ramming system 
which is currently under development by the authors. The reader will find 
more details in [1], [2], and later reports. 

t 

i 
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II, THE LANGUAGE 

Every program created by the system is a subroutine with a name which 
can be called from any subroutine including itself or which can act as a 
top-level main program. A subroutine is created by 

(1) declaring the name of the subroutine, 

(2) declaring the data structures (arrays and variables) to be refer- 
enced including information about which are arguments for the 
subroutine, which will be held in common with other subroutines, 
and which variables will be pointers into arrays, and 

(3) executing one or more example calculations using the aut op rog ramming 
language. 

After steps (1) and (2) above are completed, the declared arrays and variables 
appear on a display screen properly labelled and with all entries set to 
zero. Labelled arrows appear in the arrays resulting from pointer declar- 
ations in (2). At this point, a sample computation can be performed. A 
light pen at the display screen is used to reference the commands of the 
language which also appear on the screen and their operands among the data 
structures . 

Let p^ stand for the i-th operand touched by the light pen after a 
command is referenced. Then the commands of the language are: 

S start - the first instruction in a subroutine. 

H halt - terminate computation and return to the 

calling routine. 

+ add - p 3 «- P1+P2 or p 2 * Pl + P2" 

subtract - p 3 «- P 2 ~Pl or p 2 «- P^P-l 
* multiply - P3 «- Pj*P2 or P 2 Pi*P2' 

/ divide - p 3 «- ? 2 ^1 or p 2 p 2^ p l' 
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M move - Pj/ 

R read - gets the next integer typed on the teletype. 

If two opetands in an array e and e are 

i,j m,n 

touched, all entries e in the array are read 

x >y 

such that i i x £ m and j y<, n. 
W write - outputs integers to the teletype using conventions 

as in read, R. 

T type - outputs a specific character string to the teletype. 

C call subroutine - p^ is the subroutine name being called 

and p^ for i > 1 are the arguments to be referenced. 

? conditional - note that the relationship P1P2P3 holds where 

P2 is either = ,>, or<. 
The only data type currently available is "integer 11 . Some of the commands may 
take varying numbers of operands and their interpretation depends on how many 
are given. For example, the result of an arithmetic instruction is left in 
p^ if three operands are given and in p^ if only two are given. The digits 

0,1,2, ,9 appear on the screen for the generation of literals. Multiple 

digit literals are generated by a sequence of light pen hits on these digits. 

One can best understand the operation of the system by considering how 
a few simple examples of a linear search might be converted into a program. 
Suppose we search the array A=(l*4.2) with N=3 elements for the elements J = 
1,2, and 3. Then we declare the array, enter some typical values (1,4,2), 
and use the light pen cormands to advance the pointer across the array until 
the item is found. If it is found, we return ANS <■ 1, otherwise ANS*~ 0. As 
each calculation proceeds, the system stores the sequence of actions performed 
as shown in Figure 1 and then 
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(A(1)=J) 
ANS •*■ 1 
ri 



J=l 



s 

1^1 
I «- I + 1 
I «- I + 1 

(A(I) = J) 
ANS ■*■ 1 
H 

J=2 



S 

1^1 
I «- I + 1 
I «- I + 1 
I+I+l 
(I > N) 
ANSf- 0 
H 

J=3 



The commands generated in searching array A for J = 1,2,3 

*" a(1) = j 



J 



1 ill I 



(^1 t- I + T~|^ , A(I) = J 



Otherwise 



I > N 
ANS ^ 0 1 



CO 



The shortest program compatible with the traces. 
Figure 1. Constructing a program for a linear search. 



1 ANS f 1 | 
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constructs the shortest program which is compatible with these traces. It can 
be shown that every possible program (or its equivalent) can be so constructed 
from a finite set of traces, and the art of such synthesis is described in 
some detail in [1]. In this case, the program appearing in Figure 1 is correct 
for linear searching and can be constructed from traces in a fraction of a 
second. 

If a program synthesized on the basis of one or several traces is tested 
and proven to be incorrect, one or more additional sample calculations may be 
input to remove the bugs. The synthesis algorithm is capable of program con- 
struction from pieces of traces as well as complete traces. This means that 
if a program is correct except for some small part, a portion of the calculation 
involving just that pavt may be completed and input as a correction to the 
program. The synthesized program is the shortest program which is compatible 
with the given traces and so is guaranteed to execute those examples correctly. 
Thus if the examples are correct, a program with errors will be constructed 
only if one exists which is compatible with the traces and has length less 
than or equal to the correct program. 
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The fact that subroutines can be called while executing a sample cal- 
culation means that large programs can be constructed using many smaller 
building blocks. A subroutine called in this way presumably exists from an 
earlier synthesis and is easily available. If it does not exist, the user 
is asked to supply the result that it would have yielded if it did exist! 
Thus the sample calculation can proceed whether or not the subroutines upon 
which it depends have already been constructed. Clearly, this leaves the 
possibility open for so-called "top-down 11 programming. 

All arrays and variables for all subroutines are stored in a hash 
table where the keys are computed from a combination of the name, the assoc- 
iated subroutine name, and the level of the subroutine call. This means that 

(1) arrays are effectively infinitely large as long as the hash table 
does uot overflow so that no dimensions are associated with arrays, 
and 

(2) subroutines may be called recursively. 

A "common" feature is available for transmittal of information between sub- 
routines . 

Another feature to be included will be the "continue" feature. If a 
sample calculation is quite long and repetitive, there is a good chance that 
part or all of the program can be correctly constructed before the sample 
calculation is completed. Then a "continue" key may be touched midway through 
the sample causing a program synthesis on just the portion of the trace which 
is available. Subsequent inputs through the "continue" key will each cause 
another instruction from the synthesized program to be executed. If the pro- 
gram yields an error, a "backup" key will be available allowing the last one 
or several instructions to be "unexecuted", the corrected instructions may be 
inserted with the light pen, and the calculation can continue. As an example, 
if a column of ten integers is to be added and then printed, the user may add 
two or three with the light pen to cause a program with a loop to be constructed. 

ERJC ^ 



Then he may hit the continue key repeatedly until all are added, and finally 
insert (again with the light pen) the instruction to do the printing. The 
process of trace construction should not, therefore, be thought of as simply 
a long process of light pen work. The light pen will be used alternately 
with the continue key and backup key to allow the easiest and most carefree 
possible completion of examples. 

A great deal of effort has gone into making the system easy and comfor- 
table to use. The user may input instructions at any rate he desires without 
significant delays for internal processing. Each hit of the light pen is 
acknowledged by a momentary (one second) disappearance of the designated image. 
Erroneous light pen hits which result in illegal instructions cause a message 
to be output at the teletype, the illegal instruction is deleted, and the user 
may repeat the instruction without any special action. The sensitized points 
on the screen are well separated so that incorrect hits are not common. Sub- 
routines may be created or called at any time in a carefree manner. If in 
the process of doing a computation it is desired to declare a new array or 
pointer, this can be done at any time without interruption of the current proces 
It is hoped that the system will be so easy to use that anyone will be able to 
run it with less than an hour instruction. 
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III. THE CURRENT STATUS OF THE SYSTEM 

As already mentioned, this system is still under development and not all 
of the features given above are operative. The three major parts of the 
program, the light pen and display routines, the language interpreter, and the 
program synthesizer are all running individually and debugged. The program 
synthesizer has been extensively tested for several months as described in [1]. 
The coordinator of these programs and the interface routines are still only 
partially operative but should be completed within a few months. 

The photograph of Figure 2 shows a user with light pen referencing an 
array on the display screen. The display and keyboard at right are used for 
array declarations, input-output for the synthesized program, error messages, 
and so forth. Figure 3 shows the display screen after the declarations have 
been made for a matrix multiply calculation. The autoprog rammer instructions 
appear in a list at che right, and the ten digits for literal generation are 
at the bottom of the screen. Figure 4 shows the screen at the end of the 
computation, and Figure 5 gives the synthesized program produced after about 
50 milliseconds of internal synthesis time. Many other programs of similar 
complexity have also been synthesized by the system. 




Figure 2. The second author prepares to execute a matrix multiplication. 
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* END TRACE 



COMMANDS: 



*s 'a k h M 0 N 0 P 0 



+ ADD 



- SUB 
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|j,K 
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fN 




* MPY 



/ DIV 



MOVE 



TEMP 



0 



READ 



WRITE 



NOTE 



INSTRUCTIONS : 



DELETE 



DECLARE 



PAUSE 



LITERALS : 



0123456789 



Figure 3. The autoprogrammer display before 
executing a matrix multiply calculation 
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* END TRACE 



COMMANDS: 



0 j h K s M s 





N__ 
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50 



+ ADD 
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TEMP 



32 



LITERALS : 
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READ 



WRITE 



NOTE 



INSTRUCTIONS 



DELETE 



DECLARE 



PAUSE 



Figure 4. The display after the 

calculation has been completed 
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Read M 
Read N 
Read P 



I— 1 
K— I 
J-*- K 



Read A(l,l) to A(M,N) 
Read B(l,l) to B(N,P) 



I 



TEMP»A(I,J) * B(J,K) 
C(I,K)*- C(I,K) + TEMP 
J*- J + 1 



J > N 



otherwise 



J*- 1 

I + 1 



otherwise 



I > M 



1 

K-%- K + 1 



otherwise 



>K > P 



Halt 



Figure 5. The synthesized program for multiplcation of an M by N matrix times 
an N by P matrix. 
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IV, SOME PROBLEMS 

Light pens are inherently very slow devices and will never be adequate 
input devices for autoprogrammer systems, A more ideal hardware arrangement 
would be to have a touch-sensitive display screen mounted down flush with a 
desk top. Then the users fingers could leap around on the screen at the speed 
possible on a modern desk calculator inputting individual hits at the rate of 
several per second. There is every reason to believe that a person can think 
fast enough to achieve that rate which is nearly an order of magnitude faster 
than possible with a light pen. However, we feel that our light pen driven 
system will clearly show the value of the autoprogrammer idea and will consti- 
tute a high quality basis for further work. 

The size of the display screen is also an important limiting factor. It 
is desirable to have all the current data structures visible at one time but 
our screen is too small for the display of many arrays. This can be solved 
with windowing features, the use of multiple displays, and the dependence on 
subroutines, but all of these alternatives have difficulties. Windowing 
features are a bother for the user, multiple displays are expensive, and it is 
not always easy to hide a significant number of arrays in subroutines. Fortu- 
nately, programs can be synthesized on the basis of relatively small samples 
so that the size of displayed arrays need not be large. 

Finally, we are concerned about the ability of humans to produce correct 
example computations. Our system moves the debugging problem from the domain 
of language syntax and semantics to the domain of examples, but the possibility 
for human error remains. If the example calculation cannot be generalized into 
a correct program, certainly our system cannot succeed. For example, if we 
want a program which inputs a prime number and outputs the next sequential 
prime number, we cannot offer as a sample computation: input the integer 
11, add 2, print the result. The example calculation must execute the ins true- 



tions of some correct program in the order that the correct program would 
execute them. Only if this constraint is met will that program be found. A 
central theme of our current research is to weaken this requirement. 
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